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CRANFIELD  CONFERENCE  ON  INFORMATION  RETRIEVAL 


Introduction 


■rtia  Third  Cranfield  Conference  on  Mechanised  Information 
Storage  and  Retrieval  Systems  was  held  on  20-23  July  1971  at  the 

Instltute  of  Technology,  in  Bedfordshire,  England.  Over 
200  information  scientists,  documentation  specialists,  cataloguers,  and 
librarians  attended  the  bi-annual  conference  which  is  perhaps  the  most 
important  international  meeting  on  information  retrieval.  The  reason 
it  is  held  in  Cranfield  is  that  the  conference  organiser,  C.  w.  Cleverdon, 
an  internationally-recognired  pioneer  in  information  processing,  is  the 
director  of  the  Institute  library.  Largely  through  Cleverdon 's  effort*, 
the  program  included  invited  talks  given  by  many  of  the  world's  leading 
peciaiists  in  information  retrieval  and  documentation  including  Professor 
A.  Latarow  of  the  University  of  Minnesota,  Dr.  E.  Garfield  of  the  Institute 
for  Scientific  Information,  Dr.  c.  j.  Bell  of  IBM  (UK) ,  and  Dean  B.  M. 

University  of  Indiana,  of  the  attendees,  most  came  from 

countri8B  ”>P**««nted  were:  Czechoslovakia,  Denmark, 
Sweden,  the  Netherlands,  Belgium,  Australia,  Canada,  Luxembourg,  west 
Germany,  France,  Japan,  Ireland,  Switzerland,  Yugoslavia,  and  the  United 
Stataa.  Most  of  the  speakers  were  either  British  or  American,  and  th* 
official  Conference  language  was  English. 


Most  of  the  talks  described  on-going  projects,  seme  of  which 
are  quite  specialized  applications  of  information  retrieval.  Other 

described  the  use  of  magnetic  tape  services  such  as  MEDLARS  and 
OOMPENDEX.  A  number  of  talks  described  on-going  efforts  for  on-line 

”,tri8Val-  At  *••»*  forty-five  minutes  were  allocated  to 
each  talk  and  parallel  sessions  were  kept  to  a  minimus. 


<*t  Wednesday,  the  professional  association  for  library  sciences 
and  documentation  specialists,  ASLIB,  held  a  one-day  conference  of  their 
computer  applications  group.  Registrants  to  the  main  conference  were 
^  attend  the  ASLIB  meeting  also.  The  papers  presented  at  the 
ASLIB  Conference  tended  to  emphasize  more  pragmatic  issues  dealing  with 
the  use  of  computers  in  the  overall  operation  of  the  library.  To  an 
outside  observer,  however,  the  two  conferences  were  so  similar  in  intent 
that  they  cpqld  easily  have  been  integrated  into  one. 


n»e  conference  was  well-planned  and  efficiently  run.  Cranfield 
was  an  excellent  place  to  hold  it.  Accommodations  were  costfortable,  the 
surroundings  quiet  and  beautiful,  and  the  food  was  excellent. 
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Contributed  Papers 

D.  B.  McCam ,  "Network* ,  with  Emphasis  on  Planning  and  On-line 
Bibliographic  Access  Systems." 

McCam,  who  is  with  the  Lister  Hill  National  Canter  for  Biomedical 
Communications,  gave  a  tutorial  paper  on  computer  networks.  His  talk 
covered  the  advantages  of  networks,  different  configurations,  transmission 
speeds,  and  decentralization  versus  communications  costs.  As  an  example, 
he  described  the  planning  for  a  national  on-line  medical  bibliographic 
service.  The  service,  known  as  AIM  TWX ,  provides  on-line  search  of  all 
items  in  the  index  of  the  Abridged  Index  Medicus  for  the  past  five  years. 
Users  access  the  central  data  bases  in  the  IBM  360/67  computer  at  SDC, 
Santa  Monica  via  TWX  or  teletype  terminals.  McCam  estimates  that  60% 
of  the  users  arc  physicians.  Each  search  averages  10  sec  vs  3-6  weeks 
for  a  conventional  batch -processing  search  using  the  MEDLARS  system  of 
the  National  Library  of  Medicine.  The  cost  of  each  search  is  approxi¬ 
mately  $4.00  with  $3.00  for  coaimuni cations  and  $1.00  for  computer  time. 
When  fully  operational  in  1972,  the  system  is  designed  to  handle  about 
180,000  requests  per  year. 

A.  Lazarow,  W.  J.  Schoener,  E.  H.  Brekhus,  J.  L.  Izzo,  K.  C. 
Goodman,  and  K.  Kreuger  "The  Diabetes  Literature  Index  and  the  Developing 
Minnesota  Diabetes  Information  Center." 

In  one  of  the  finest  papers  presented  at  the  Conference,  Lazarcw 
described  the  Diabetes  Literature  Index  work  of  the  National  Institute 
of  Arthritis  and  Metabolic  Diseases  (NIAMD) .  He  described  the  process 
whereby  diabetes-related  literature  is  derived  from  the  National  Library 
of  Medicine's  MEDLARS  magnetic  tapes  which  is  a  regularly  updated  index 
of  all  current  medical  literature.  Approximately  two-thirds  of  the 
diabetes -related  citations  are  automatically  identified  by  computer  via 
a  key-word  index.  The  remaining  one-third  is  manually  selected  by  an 
experienced  MD.  The  final  derived  magnetic  tape  for  the  diabetes  index 
contains  the  complete  bibliographic  citation  indexed  by  key  words. 

Lasarow  strongly  believes  that  the  complete  citation  is  much  more  useful 
than  a  fragmented  title  and  saves  valuable  user's  time  at  the  cost  of 
slightly  greater  computer  processing  time.  Every  month,  the  diabetes 
index  tape  is  sent  to  the  Government  Printing  Office  where  the  Diabetes 
Literature  Index  is  printed.  The  project  is  a  prime  example  of  effective 
utilization  of  cceqmters  in  information  retrieval. 

w.  M.  Woods, "Design  and  Development  of  a  Trans  disciplinary 
Engineering  Information  Program." 
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B.  Hisinger,"A  Multidisciplinary  and  Computerized  SDI  Service 
for  Industry  and  Research  -  Practical  Experience  and  Costs." 

M.  E.  williams,  "Experiences  of  XIT  Research  Institute  in  Operating 
Computerised  Retrieval  System  for  Searching  a  Variety  of  Data  Bases.” 

J.  Byrne,  P.  J.  Currivan,  and  P.  V.  Mahon  "A  Current  Awareness 
System  Based  upon  INSPEC  Tapes." 

The  above  four  papers  all  deal  with  information  services  on 
magnetic  tape,  rtiese  services  send  to  their  clients  each  month  a  magnetic 
tape  with  thousands  of  abstracted  items  containing  subject  headings, 
cross  references,  bibliographic  citations  and  abstracts. 

An  information  service  that  was  discussed  in  three  of  the 
above  papers  was  the  COMPENDEX  (Computerized  Information  Index)  tape 
service  which  was  established  by  Engineering  Index,  Inc.  in  1969. 

COMPENDEX  offers  its  subscribers  a  package  consisting  of  a  monthly 
magnetic  tape  service  with  more  than  6000  items,  and  the  monthly  and  the 
annual  Engineering  Index.  B.  M.  Woods,  managing  director  of  Engineering 
Index  described  how  COMPendex  works. 

B.  Hisinger  of  the  Technical  Library  of  Denmark  gave  a  talk  on  a 
computerized  SDI  (Selective  Dissemination  of  Information)  service  based 
upon  COMPENDEX  and  INSPEC  (a  tape  service  offered  by  IEE)  tapes.  In 
another  paper  that  dealt  with  COMPENDEX,  Miss  M.  E.  Williams  described 
a  current  awareness  (SDI)  service  developed  at  the  Illinois  Institute 
of  Technology  Research  Institute.  In  addition  to  COMPENDEX,  the  SDI 
service  also  reviews  Chemical  Abstracts  Condensates  and  Biological  Abstracts 
Previews .  In  operation  since  1969,  the  service  also  offers  retrospective 
searches  tailored  to  a  user's  or  organization's  needs. 

For  COMPENDEX  services,  the  yearly  leasing  cost  is  $7300, 
broken  down  as  follows <  $6500.-  for  the  COMPENDEX  data  bass,  $500  - 

for  the  monthly  subscription  to  Engineering  Index,  and  $300  -  for  the 
magnetic  tapes  themselves.  Although  individual!!  probably  can  not  afford 
the  COMPENDEX  service ,  institutions  with  a  fairly  substantial  library 
budget  may  be  able  to. 

D.  E.  Garfield  and  A.  E.  Cawkell,  "The  Relationship  Between  Large 
Multidisciplinary  Systems  and  Specialized  Information  Services." 

In  one  of  the  most  hotly  debated  papers,  D.  E.  Garfield,  President 
of  the  Institute  for  Scientific  Information,  and  a  pioneer  in  Information 
Retrieval  gave  a  talk  on  citation  indexes.  He  showed  how  citation  indexes 
give  valuable  feedback  on  the  relative  importance  of  a  particular  paper 
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or  journal  in  tarns  of  tha  msnbar  of  tinas  tha  paper  or  journal 
has  bean  cited  in  othar  papars.  In  advance  of  his  talk,  Garfield 
distributed  an  alphabetic  listing  of  tha  100  most  frequently  citad 
scientific  journals.  This  listing  ranks  scientific  journals  ac¬ 
cording  to  two  standards  -  rank  by  total  citations,  and  rank  by  impact 
factor.  The  term  Impact  factor  is  defined  as  tha  ratio  of  total 
citations  to  total  source  itams  over  a  given  period.  Thus,  impact 
factor  is  a  normalised  criterion  in  which  the  sise  of  the  journal, 
i.e.,  its  total  number  of  pages,  does  not  enter  in,  and  perhaps  is  a 
more  relevant  index  than  total  citations.  For  tha  reader's  infor¬ 
mation,  the  ten  most  frequently  cited  scientific  journals  are  Hated 
below.  The  complete  list  is  to  be  published  in  Science  (number  seven 
in  total  citations) .  It  should  be  noted  that  Garfield's  complete  list 
includes  over  a  thousand  journals,  all  ranked  by  total  citation  and 
impact  factor. 

Rank  by  Rank  by 

JOURNAL  Total  Citation  Impact  Factor 


J.  AM  CHEM  SOC 

1 

6 

PHYS  REV 

2 

18 

J  BIOL  CHEM 

3 

4 

NATURE 

4 

48 

J  CHEM  PHYS 

5 

26 

J  CHEM  SOC 

6 

31 

SCIENCE 

7 

34 

BIOCHEM  BIOPHYS  ACTA 

8 

29 

P  NAT  ACAM  SCI  USA 

9 

2 

BIOCHEM  J 

iO 

30 

The  journal  ranked  first  in  impact  factor,  J  Mol  Biol,  ranked 
eighteenth  in  total  citations.  From  the  listing  of  tha  100  most 
frequently  cited  journals  we  can  make  the  following  observations: 

(1)  All  the  journals  listed  relate  to  science  or  medicine.  Not  a 
single  journal  in  engineering  or  technology  appeared  in  the  top  100. 

This  does  not  mean  that  their  impact  factor  is  not  high,  but  does  mean 
that  engineering  journals  are  rather  too  specialised  to  be  cited  widely. 

(2)  Most  of  the  journals  in  the  top  100  are  in  the  English  language, 
which  gives  some  support  to  the  thesis  that  English  is  tha  international 
scientific  language,  rather  than  French,  Russian  or  German.  (3)  Review 
journals,  such  as  the  Rev  Mod  Phys  which  ranks  97  in  total  citations  and 
12  in  impact  factor,  usually  have  high  impact  factors.  This  is  because 
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■oat  scientists  do  not  hava  enough  tine  to  keep  up  with  tha  literature, 
a van  in  thair  own  specialisation,  and  auat  increasingly  raly  on  raviaw 
articlaa  for  inforaatioo. 

Garfiald  waa  challenged  rather  aharply  in  hia  total  dedication 
to  citation  indaxaa  aa  the  only  acceptable  Mans  for  information 
analyaia.  He  atood  hia  ground,  however,  and  inaietad  that  hia  nuebere 
are  valid.  Hia  concluding  remark ■  were  that  in  the  information  ex¬ 
plosion  we  are  living  in  today  in  which  the  number  of  journals  and 
number  of  journals  pages  are  increasing  almost  exponentially,  a  small 
niaaber  of  journals,  a ay  200  account  for  a  substantial  percentage 
(30%)  of  the  total  citations.  Therefore,  even  with  new  journals 
cropping  up  everyday,  the  old  standards,  such  as  Solanos,  remain  as 
important  as  ever. 

Garfield's  talk  served  to  underline  the  importance  of  information 
retrieval  in  an  age  where  too  much  information  is  disseminated  for 
people  to  absorb.  It  also  showed  tha  importance  of  meetings  such  as 
tha  Cran field  Conference. 

S.  Adams  "The  Hole  of  Reprocessing  in  ONI8IST. " 

UNISIST  is  an  acronym  which  stands  for  a  feasibility  study  on 
tha  establishment  of  s  world  science  information  system.  The  study  was 
jointly  sponsored  by  the  United  Nations  Educational,  Scientific  and 
Cultural  Organisation  (UNESCO)  and  the  International  Council  of  Scien¬ 
tific  Unions  (ICSO).  The  study,  reported  in  e  monograph  entitled  UNI SIT 
(UNESCO,  Paris  1971) ,  shows  that  it  is  both  desirable  and  fa as ib la 
for  developing  nations  to  have  information  services,  and  oslls  for  the 
sharing  of  information  coats  and  services  between  developed  and  develop¬ 
ing  nations.  Other  recommendations  osll  for  the  establishment  of 
governmental  policies  favoring  such  interchange,  end  the  experimental 
interlinking  of  information  systems  between  developing  and  industrialised 
countries.  Adams  suggests  that  with  tha  implementation  of  information 
sharing  programs,  national  reprooasaing  canters ,  i.a. ,  centers  for  the  re¬ 
packaging  of  information  from  date  bases  such  as  OOKPENDBX,  should  be 
crested.  The  talk  wee  highly  speculative  and  it  was  difficult  to  asaeaa 
what  spadfic  actions  Adams  reoommsndsd. 

it  should  be  noted  that  e  UNESCO  conference  is  being  held  at 
this  time  (1  October  1971)  in  Peris  to  discuss  snd  implement  the 
recommendations  given  in  UNISIST. 

A  list  of  papers  appears  in  Appendix  A. 
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LIST  OF  PAPERS 


D.  B.  McCami  Natworks,  with  emphasis  on  planning  an  on-line  biblio¬ 

graphic  access  system 

W.  N.  Hoods i  Design  and  development  of  a  transdisciplinary  snginsering 
information  program 

E.  H.  Brenner i  Interactions  betwean  distributors  of  discipline-oriented 

information,  industry  information  scientists,  and  the 
ultimate  users 

J.  R.  Kelly  end  P.  Popper:  Anglo-American  cooperation  in  a  company 
information  system 

S.  Keenan:  Recent  developments  in  the  production  of  computer  based 
services  and  prospects  for  the  1970 ’s 

J.  Newton:  Development  of  computer  based  retrieval  services  for  the 
Intemetion  Food  Information  service 

N.  Williams:  Experiences  of  IXT  Research  Institute  in  operating  a 

computerised  retrieval  system  for  searching  a  variety  of 

date  beaee 

E.  J.  Scott,  H.  M.  Townley  and  B.  T.  Stem:  A  technique  for  the  evaluation 

of  a  commercial  information  service  end  some  preliminary 
results  free:  the  Drug  doc  service  of  Excerpta  Medics 
Foundation 

F.  H,  Barker,  D.  C.  Veal  and  B.  K.  Wyatt:  Towards  the  eutosutic  con¬ 

struction  of  search  profiles. 

F.  H.  Barker,  d,  c.  Veal  and  B.  K.  Wyatt:  Comparative  efficiency  of 
searching  titles,  abstracts  and  indsx  terms  in  e  f tee- 
text  data  base 

A.  Herts:  state  information  policy  in  Czachoslovakia  in  the  field  of 

science  and  technology 

B.  Hiainger:  A  multidisciplinary  and  computsrisad  SDI  servioa  for 

industry  and  research  -  practical  experience  and  costs 

J.  Martyn:  Serving  interdisciplinary  need-groups  from  basic  abstracting 
sources 


6 


c-ae-71 


APPENDIX  A 


A.  Lata row,  w.  J.  Schoensr,  B.  H.  BreXhus,  J.  L.  Ixso,  K.  C.  Goodtown, 

and  K.  Kreuger:  Tha  diabetes  lltaratura  index  and  tha  da va loping 
Mlnnaaota  Diabataa  Information  Can tar 

N.  Stutsman  and  E.  da  Hartt  Ccatputar-basad  htmanitiea  rafaranoa  service 
and  tha  uni  varsity  library 

P.  G.  Wat* on:  Computerised  information  aarrioaa  for  tha  University 
cossmmlty 

C.  E.  Elwim  Coma  a  a  pa  eta  on  integration  of  information  aarvicas  in 
smdieine,  biology  and  ehamiatry  at  a  atultidiscipllnary 
documentation  oantar 

C.  J.  Ball  and  T.  w.  Rogers:  Adaptability  to  change  in  large  data  base 
retrieval  ay a tarn* 

E.  Garfield  and  A.  X.  Cavkall :  The  relation ah ip  between  large  multi¬ 
disciplinary  systems  and  apadalixad  information  service 

A.  E.  Negus  and  J.  L.  Hall:  toward a  an  affective  on-lins  rafaranoa 
ratrlaval  systam 

L.  D.  Higgins,  M.  Carvilla  and  F.  J.  Smiths  Interactive  rafaranoa 
ratriaval  in  larga  filaa 

L.  H.  Thiel  and  H.  S.  Heaps :  Program  design  for  retrospective  searches 
on  larga  data  bases. 

C.  R.  Clough  and  K.  M.  Bramwell:  A  single  computer  based  system  for  both 
currant  awareness  and  retrospective  search :  operating  ex¬ 
perience  with  'ASSASSIN* 

T.  M.  Aitchison  and  H.  D.  Martin:  Design  and  development  of  a  tape 
service 

J.  Byrne,  P.  J.  Cur  riven  and  F.  V.  Mahon:  A  currant  «w*nnesa  systam 
based  on  INSPEC  tapes 

w.  Simmler:  A  documentation  systam  for  inorganic  and  physical  chemistry 

C.  D.  Green:  Problems  of  indexing  and  ratrlaval  for  multidisciplinary 
information  fields 


7 


C-28-71 


APPENDIX  A 

'cott  Adams:  The  role  of  reprocessing  in  UNISIST 

B.  K,  Fry:  New  management  concepts  for  inter library  cooperation 

H.  F.  Dammers  (Shell  Research  Ltd.,  Sittingbourne) :  Introduction:  The 
work  of  the  As lib  Computer  Applications  Group 

A.  Evans  (Loughborough  University  of  Technology) :  Developments  in  serial 
control  systems 

R.  M.  Duchesne  (Bath  University  of  Technology):  Survey  of  automation  in 
university  libraries 

c.  W.  J.  Mil  on  (AERE,  HARWELL):  Trends  in  automatic  circulation  control 
systems 

R.  Coward  (British  National  Bibliography) :  The  B.N.B.  Host  Sussex  MARC- 
based  acquisition  system 

T.  M.  Aitchison  (INS^EC) :  Criteria  for  computer  output  in  information 
systems 

L.  Corbett  (AWRE,  Aldermaston) :  Magnetic  tape  data  bases:  Availability 
and  search  programs 


8 


